Application of Deep Reinforcement Learning in Maneuver Planning of Beyond-Visual-Range Air Combat

نویسندگان

چکیده

Beyond-visual-range (BVR) engagement becomes more and popular in the modern air battlefield. The key difficulty for pilots fight is maneuver planning, which reflects tactical decision-making capacity of both sides determinates success or failure. In this paper, we propose an intelligent planning method BVR combat with using improved deep Q network (DQN). First, a basic environment builds, mainly includes flight motion model, relative model missile attack model. Then, create decision framework agent interaction environment. Basic perceptive variables are constructed agents to form continuous state space. Also, considering threat each side constraint airfield, reward function designed training. Later, introduce training algorithm perceptional situation layers value fitting replace policy DQN. Based on long short-term memory (LSTM) cell, layer can convert high-dimensional perception situation. does well mapping action. Finally, three scenarios testing. Simulation result shows avoid enemy gather own advantages target. It also proves models methods valid be realized.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Case-Based Behavior Recognition in Beyond Visual Range Air Combat

An unmanned air vehicle (UAV) can operate as a capable team member in mixed human-robot teams if it is controlled by an agent that can intelligently plan. However, planning effectively in a beyond-visual-range air combat scenario requires understanding the behaviors of hostile agents, which is challenging in partially observable environments such as the one we study. In particular, unobserved h...

متن کامل

Active Behavior Recognition in Beyond Visual Range Air Combat

Accurately modeling uncontrolled agents (or recognizing their behavior and intentions) is critical to planning and acting in a multi-agent environment. However, behavior recognition systems are only as good as their observations. Here we argue that acting, even acting at random, can be a critical part of gathering those observations. Furthermore, we claim that acting intelligently via automated...

متن کامل

ACTIVE BEHAVIOR RECOGNITION IN BVR AIR COMBAT Active Behavior Recognition in Beyond Visual Range Air Combat

Accurately modeling uncontrolled agents (or recognizing their behavior and intentions) is critical to planning and acting in a multi-agent environment. However, behavior recognition systems are only as good as their observations. Here we argue that acting, even acting at random, can be a critical part of gathering those observations. Furthermore, we claim that acting intelligently via automated...

متن کامل

A Goal Reasoning Agent for Controlling UAVs in Beyond-Visual-Range Air Combat

We describe the Tactical Battle Manager (TBM), an intelligent agent that uses several integrated artificial intelligence techniques to control an autonomous unmanned aerial vehicle in simulated beyond-visual-range (BVR) air combat scenarios. The TBM incorporates goal reasoning, automated planning, opponent behavior recognition, state prediction, and discrepancy detection to operate in a real-ti...

متن کامل

Operation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm

: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2021.3060426